Goto

Collaborating Authors

 family history


What's my Alzheimer's risk, and can I really do anything to change it?

New Scientist

What's my Alzheimer's risk, and can I really do anything to change it? Can you escape your genetic inheritance, and do lifestyle changes actually make a difference? Daniel Cossins set out to understand what the evidence on Alzheimer's really means for him A few years ago, my dad was diagnosed with Alzheimer's disease, just like his older brother and his mum before him. Slowly, his personality began to ebb away. Now, at the age of 75, his cognitive decline is accelerating: he no longer recognises his granddaughters, for instance, and he lives in a near-constant state of confusion, which means he is losing his independence, too. As I process this loss and try to support my parents, I have become increasingly curious about what my family history means for me.


Machine Learning Meets Transparency in Osteoporosis Risk Assessment: A Comparative Study of ML and Explainability Analysis

Elias, Farhana, Reza, Md Shihab, Mahmud, Muhammad Zawad, Islam, Samiha, Alve, Shahran Rahman

arXiv.org Artificial Intelligence

The present research tackles the difficulty of predicting osteoporosis risk via machine learning (ML) approaches, emphasizing the use of explainable artificial intelligence (XAI) to improve model transparency. Osteoporosis is a significant public health concern, sometimes remaining untreated owing to its asymptomatic characteristics, and early identification is essential to avert fractures. The research assesses six machine learning classifiers: Random Forest, Logistic Regression, XGBoost, AdaBoost, LightGBM, and Gradient Boosting and utilizes a dataset based on clinical, demographic, and lifestyle variables. The models are refined using GridSearchCV to calibrate hyperparameters, with the objective of enhancing predictive efficacy. XGBoost had the greatest accuracy (91%) among the evaluated models, surpassing others in precision (0.92), recall (0.91), and F1-score (0.90). The research further integrates XAI approaches, such as SHAP, LIME, and Permutation Feature Importance, to elucidate the decision-making process of the optimal model. The study indicates that age is the primary determinant in forecasting osteoporosis risk, followed by hormonal alterations and familial history. These results corroborate clinical knowledge and affirm the models' therapeutic significance. The research underscores the significance of explainability in machine learning models for healthcare applications, guaranteeing that physicians can rely on the system's predictions. The report ultimately proposes directions for further research, such as validation across varied populations and the integration of supplementary biomarkers for enhanced predictive accuracy.


From Conversation to Query Execution: Benchmarking User and Tool Interactions for EHR Database Agents

Lee, Gyubok, Chay, Woosog, Kwak, Heeyoung, Kim, Yeong Hwa, Yoo, Haanju, Jeong, Oksoon, Son, Meong Hi, Choi, Edward

arXiv.org Artificial Intelligence

Despite the impressive performance of LLM-powered agents, their adoption for Electronic Health Record (EHR) data access remains limited by the absence of benchmarks that adequately capture real-world clinical data access flows. In practice, two core challenges hinder deployment: query ambiguity from vague user questions and value mismatch between user terminology and database entries. To address this, we introduce EHR-ChatQA an interactive database question answering benchmark that evaluates the end-to-end workflow of database agents: clarifying user questions, using tools to resolve value mismatches, and generating correct SQL to deliver accurate answers. To cover diverse patterns of query ambiguity and value mismatch, EHR-ChatQA assesses agents in a simulated environment with an LLM-based user across two interaction flows: Incremental Query Refinement (IncreQA), where users add constraints to existing queries, and Adaptive Query Refinement (AdaptQA), where users adjust their search goals mid-conversation. Experiments with state-of-the-art LLMs (e.g., o4-mini and Gemini-2.5-Flash) over five i.i.d. trials show that while agents achieve high Pass@5 of 90-95% (at least one of five trials) on IncreQA and 60-80% on AdaptQA, their Pass^5 (consistent success across all five trials) is substantially lower by 35-60%. These results underscore the need to build agents that are not only performant but also robust for the safety-critical EHR domain. Finally, we provide diagnostic insights into common failure modes to guide future agent development.


AI could save your life! A 400 15-minute full-body scan to detect the earliest signs of cancer is on the horizon thanks to artificial intelligence

Daily Mail - Science & tech

Most people spend their lunch breaks grabbing a sandwich or going for a walk. But soon it could be possible to get a full-body MRI scan which detects the earliest stages of cancer during your lunch hour, thanks to AI. Health tech pioneer Ezra has launched its screening service in the UK, marking a major expansion beyond the US. Their AI-powered scans currently last an hour and cover 13 organs, with the added option of an extra lung CT scan and heart disease screening. As cancer rates are rising – especially among young people – the company say they are the best defence against the disease. With early detection, treatment can start earlier and prognosis improves dramatically.


Afrispeech-Dialog: A Benchmark Dataset for Spontaneous English Conversations in Healthcare and Beyond

Sanni, Mardhiyah, Abdullahi, Tassallah, Kayande, Devendra D., Ayodele, Emmanuel, Etori, Naome A., Mollel, Michael S., Yekini, Moshood, Okocha, Chibuzor, Ismaila, Lukman E., Omofoye, Folafunmi, Adewale, Boluwatife A., Olatunji, Tobi

arXiv.org Artificial Intelligence

Speech technologies are transforming interactions across various sectors, from healthcare to call centers and robots, yet their performance on African-accented conversations remains underexplored. We introduce Afrispeech-Dialog, a benchmark dataset of 50 simulated medical and non-medical African-accented English conversations, designed to evaluate automatic speech recognition (ASR) and related technologies. We assess state-of-the-art (SOTA) speaker diarization and ASR systems on long-form, accented speech, comparing their performance with native accents and discover a 10%+ performance degradation. Additionally, we explore medical conversation summarization capabilities of large language models (LLMs) to demonstrate the impact of ASR errors on downstream medical summaries, providing insights into the challenges and opportunities for speech technologies in the Global South. Our work highlights the need for more inclusive datasets to advance conversational AI in low-resource settings.


Assessing Alcohol Use Disorder: Insights from Lifestyle, Background, and Family History with Machine Learning Techniques

Wang, Chenlan, Huang, Gaojian, Luo, Yue

arXiv.org Artificial Intelligence

This study explored how lifestyle, personal background, and family history contribute to the risk of developing Alcohol Use Disorder (AUD). Survey data from the All of Us Program was utilized to extract information on AUD status, lifestyle, personal background, and family history for 6,016 participants. Key determinants of AUD were identified using decision trees including annual income, recreational drug use, length of residence, sex/gender, marital status, education level, and family history of AUD. Data visualization and Chi-Square Tests of Independence were then used to assess associations between identified factors and AUD. Afterwards, machine learning techniques including decision trees, random forests, and Naive Bayes were applied to predict an individual's likelihood of developing AUD. Random forests were found to achieve the highest accuracy (82%), compared to Decision Trees and Naive Bayes. Findings from this study can offer insights that help parents, healthcare professionals, and educators develop strategies to reduce AUD risk, enabling early intervention and targeted prevention efforts.


Conversational Disease Diagnosis via External Planner-Controlled Large Language Models

Sun, Zhoujian, Luo, Cheng, Liu, Ziyi, Huang, Zhengxing

arXiv.org Artificial Intelligence

The development of large language models (LLMs) has brought unprecedented possibilities for artificial intelligence (AI) based medical diagnosis. However, the application perspective of LLMs in real diagnostic scenarios is still unclear because they are not adept at collecting patient data proactively. This study presents a LLM-based diagnostic system that enhances planning capabilities by emulating doctors. Our system involves two external planners to handle planning tasks. The first planner employs a reinforcement learning approach to formulate disease screening questions and conduct initial diagnoses. The second planner uses LLMs to parse medical guidelines and conduct differential diagnoses. By utilizing real patient electronic medical record data, we constructed simulated dialogues between virtual patients and doctors and evaluated the diagnostic abilities of our system. We demonstrated that our system obtained impressive performance in both disease screening and differential diagnoses tasks. This research represents a step towards more seamlessly integrating AI into clinical settings, potentially enhancing the accuracy and accessibility of medical diagnostics.


Identifying Health Risks from Family History: A Survey of Natural Language Processing Techniques

Dai, Xiang, Karimi, Sarvnaz, O'Callaghan, Nathan

arXiv.org Artificial Intelligence

Electronic health records include information on patients' status and medical history, which could cover the history of diseases and disorders that could be hereditary. One important use of family history information is in precision health, where the goal is to keep the population healthy with preventative measures. Natural Language Processing (NLP) and machine learning techniques can assist with identifying information that could assist health professionals in identifying health risks before a condition is developed in their later years, saving lives and reducing healthcare costs. We survey the literature on the techniques from the NLP field that have been developed to utilise digital health records to identify risks of familial diseases. We highlight that rule-based methods are heavily investigated and are still actively used for family history extraction. Still, more recent efforts have been put into building neural models based on large-scale pre-trained language models. In addition to the areas where NLP has successfully been utilised, we also identify the areas where more research is needed to unlock the value of patients' records regarding data collection, task formulation and downstream applications.


How FamilySearch is using the future to discover the past with AI - Deseret News

#artificialintelligence

FamilySearch has made more than 2.6 billion historical resources available to the public, and according to John Alexander who is a senior product manager there, there's a lot more on the way. More than 5 billion more documents -- collected and converted to digital images -- need to be transcribed to make them searchable and usable in FamilySearch's database. And 1 to 2 million more are added every single day. With the development of new artificial intelligence technology, there's more hope of getting billions of records to families looking for information about their relatives in as little as five years. And it's already being tested and used.


AI reveals link between family history and type 1 diabetes risks - Futurity

#artificialintelligence

You are free to share this article under the Attribution 4.0 International license. A new data-driven approach is offering insight into people with type 1 diabetes, who account for about 5-10% of all diabetes diagnoses. The researchers gathered information through health informatics and applied artificial intelligence (AI) to better understand the disease. In the study, they analyzed publicly available, real-world data from about 16,000 participants enrolled in the T1D Exchange Clinic Registry. By applying a contrast pattern mining algorithm, researchers were able to identify major differences in health outcomes among people living with type 1 diabetes who do or do not have an immediate family history of the disease.